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Mycoplasma californicum is one of several mycoplasmal species associated with bovine mastitis. The complete genome sequence 
of 793,841 bp has been determined and annotated for the M. californicum ST-6 type strain, providing a resource for the identifi- 
cation of surface antigens and putative pathoadaptive features. 
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Mycoplasma mastitis is an increasing problem for the U.S. 
dairy industry (1). Mycoplasma bovis is the most commonly 
isolated species from the group of wall-less pathogens associated 
with mastitis, with Mycoplasma californicum being the second 
most prevalent in some studies (2, 3). In contrast to M. bovis, 
almost nothing is known about the gene portfolio and virulence 
factors of M. californicum. To address these knowledge gaps, the 
genome sequence of the type strain, first isolated from a mastitis 
outbreak in California in 1 972 (4) , was determined and annotated. 

Genomic DNA was prepared from M. californicum ST-6 T (ob- 
tained from the International Organization for Mycoplasmology 
[IOM] culture collection, Purdue, IN) and sequenced using the 
Pacific Biosciences platform at the National Center for Genome 
Resources (NCGR) , Santa Fe, NM. A single SMRT cell was used to 
generate 1 8 1 ,998 reads that were assembled using HGAP version 2 
(5). As a result, 1 large (-790 kb) and two small (-3.8 kb) contigs 
emerged, with 230 X coverage, which were readily closed by PCR 
and Sanger sequencing. The 793,841-bp genome sequence was 
automatically annotated using the PGAP pipeline at NCBI, fol- 
lowing which the resulting open reading frames (ORFs) were 
manually curated. The genome comprises 669 genes with 611 
ORFs, 21 independently verified pseudogenes, 31 tRNAs, and two 
copies of each rRNA (with 5S rRNA genes separated from the two 
16S-23S rRNA operons). The G+C content is 30.84%. 

A motif query for surface lipoproteins disclosed 52 putative 
lipoprotein-encoding genes, including genes for two tandem 
paralogs of P30 (an immunodominant antigen of Mycoplasma 
agalactiae [6] ) and two paralogs of MALP-404 (7). The presence of 
lipoprotein genes encoding nucleases (three predicted), acid 
phosphatase, and 5 ' -nucleotidase, together with a homolog of the 
polynucleotide binding lipoprotein (8), highlights the critical im- 
portance of nucleotide acquisition in the absence of complete nu- 
cleotide biosynthesis pathways. 

The ability of mycoplasmas to undergo high-frequency switch- 
ing of surface antigens has long been recognized (9) and is pre- 
dicted to reflect successful adaptation to the host (10). Interroga- 
tion of the genome for distinctive hypermutable motifs revealed 



that the ORFs for 7 surface lipoproteins were preceded by perfect 
(6 instances) or near-perfect homopolymeric tracts that poten- 
tially orchestrate phase-variable expression resulting in combina- 
torial surface protein arrays (11). 

Mycoplasmas are amino acid auxotrophs, necessitating the ac- 
quisition of such precursors by proteolysis and transport (12). 
M. californicum and the most closely related organism Myco- 
plasma bovigenitalium (13) are unusual among Mycoplasma spe- 
cies in that both possess a gene encoding the tryptophan synthase 
beta-subunit (TrpB). In most organisms that contain TrpB, this 
protein functions as an a 2 fi 2 tetramer with TrpA (14). This mul- 
timer is the best studied example of substrate channeling (15); 
TrpA cleaves indole glycerol 3-phosphate to glyceraldehyde 
3 -phosphate and indole, with indole passing directly through an 
interconnecting tunnel for tryptophan synthesis by TrpB. An or- 
phan trpB gene is unusual in prokaryotes and raises the possibility 
that the organism can synthesize tryptophan from exogenous in- 
dole. 

The annotated genomic sequence is the first for this species and 
expands the data sets available for studying the evolution and host 
adaptation of ruminant mycoplasmas. 

Nucleotide sequence accession number. This complete ge- 
nome sequence has been deposited at DDBJ/EMBL/GenBank un- 
der the accession no. CP007521. 
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